Quagent control via Passive and Active Learning

نویسندگان

  • Rob Van Dam
  • Greg Briggs
چکیده

Artificial intelligence algorithms using passive and active learning versions of direct utility estimation, adaptive dynamic programming and temporal difference approaches to simulate an agent. The explored worlds consisted of discrete states (positions) bounded by internally generated “walls” that included one or more terminal states and a pre determined configuration of rewards for each state. Passive learning algorithms use a pre-calculated optimal movement policy to travel from the start state to the best terminal state. Active learning algorithms use an initially random movement policy and correct the known policy based on the percepts received within the world. In a passive learning scenario, all three approaches were found to be effective since the optimal policy was known. In active learning scenarios, direct utility estimation tends to result in unsolvable situations or less than optimal policies as a result of poor initial random that create unreachable areas. Adaptive dynamic programming is very effective in an active learning scenario but is inefficient in storage space and often fails to evaluate the entire map. Temporal difference learning approaches are very space efficient but often require more trial runs to approach the same level of accuracy that adaptive dynamic programming achieves. These different learning techniques can be tested with or without the use of quake and the Quagent bot.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effects of Cooperative Language Learning Strategies on Learning Active and Passive Structures among Iranian EFL Learners

This study aims at investigating the effects of cooperative language learning on learning active and passive structures among Iranian EFL students. The participants of the study were 60 high school students that were selected from third grade of Barikbin high school in Qazvin. All of the participants were male. Their level of proficiency was intermediate. Then the participants were divided into...

متن کامل

Comparison of Learning and Memory in Morphine Dependent Rats using Different Behavioral Models

There are several conflicting evidences showing the effect of morphine on learning and memory processes. In the present study the effect of chronic morphine administration on passive avoidance, active avoidance and spatial learning and memory of morphine dependent male rats using Passive Avoidance shuttle box and Morris Water Maze tasks were investigated, respectively. Male rats received morphi...

متن کامل

Comparison of Learning and Memory in Morphine Dependent Rats using Different Behavioral Models

There are several conflicting evidences showing the effect of morphine on learning and memory processes. In the present study the effect of chronic morphine administration on passive avoidance, active avoidance and spatial learning and memory of morphine dependent male rats using Passive Avoidance shuttle box and Morris Water Maze tasks were investigated, respectively. Male rats received morphi...

متن کامل

Comparison of Design Process in Student and Instructor

In this paper the designing products of B.A. Sophomore students of architecture in TehranUniversity who were divided into two kinds of learning namely technical and skill-based learning. In technical learningthe subjective steps of creativity process i.e. "insight", "preparation", "incubation", "intuition", and "verification"were discussed and it was suggested that these steps cannot be taught ...

متن کامل

Study of Numerical Processing Speed, Implicit and Explicit Memory, Active and Passive Memory, Conservation Abilities, and Visual-Spatial Skills of Students with Dyscalculia

Background and Purpose: Learning disorder is one of the common disorders in students, which can lead to the occurrence of educational problems and secondary disorders in them. Based on psychopathological criteria, dyscalculia is one of the subcategories of learning disorder. Children with this disorder have problems in perception of spatial relations and in different cognitive abilities. Theref...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004